Logical partitioning of a physical device

ABSTRACT

In one embodiment, an indication of a fault condition is received relating to a first service running on a physical device in a computer network. The first service is associated with a first virtual device context defined on the physical device. Then, the first service is disabled without affecting operation of a second service on the physical device. The second service is associated with a second virtual device context defined on the physical device. In another embodiment, a first virtual device context is created on a physical device in a computer network. Then, a second virtual device context is created on the physical device. The first virtual device context may then be managed independently of the second virtual device context such that resources assigned to a virtual device context are managed without affecting management of another virtual device context.

BACKGROUND

1. Technical Field

The present disclosure relates to computer networking.

2. Description of the Related Art

Next generation network devices may be designed with multiple technologies embedded into a single device. For example, a device may designed with storage, Ethernet switching, and Ethernet routing. There may also be multiple protocols supported within each device technology. For example, multiple Ethernet protocols may be supported by the single device. Even though all of these technologies are hosted in one device, they participate in completely independent networks.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram illustrating an example of a virtualization architecture.

FIG. 2 is a diagram illustrating an example of using Virtual Device Contexts (VDCs) for high availability.

FIG. 3 is a diagram illustrating an example of using VDCs for improved hardware resource utilization.

FIG. 4 is a flow diagram illustrating an example of a method for operating a physical device in a computer network.

FIG. 5 is a flow diagram illustrating another example of a method for operating a physical device in a computer network.

FIG. 6 is a simplified block diagram illustrating an example of a router or switch on which one or more of the processes described above may be run.

DESCRIPTION OF EXAMPLE EMBODIMENTS Overview

In one embodiment, an indication of a fault condition is received relating to a first service running on a physical device in a computer network. The first service is associated with a first virtual device context defined on the physical device. Then, the first service is disabled without affecting operation of a second service on the physical device. The second service is associated with a second virtual device context defined on the physical device.

In another embodiment, a first virtual device context is created on a physical device in a computer network. Then, a second virtual device context is created on the physical device. The first virtual device context may then be managed independently of the second virtual device context such that resources assigned to a virtual device context are managed without affecting management of another virtual device context.

Example Embodiments

In this application, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be obvious, however, to one skilled in the art, that the present invention may be practiced without some or all of these specific details. In other instances, well known process steps have not been described in detail in order to not obscure the present invention.

As devices such as data center devices, which incorporate numerous different technologies into individual devices, progress towards a more service oriented network environment, it may be beneficial to utilize network devices as a resource that can be partitioned based on service requirements. Even though different services co-exist on a physical device, each service can often have different requirements for fault isolation, management isolation, as well as resource isolation and allocation.

In hosted environments when multiple administrators are managing a single physical device, co-ordination must occur between all of the administrators before changing any configuration. A misconfiguration by one administrator could bring down the entire device, affecting resources outside of the control and/or purview of the administrator.

Furthermore, network switches and routers are traditionally designed to provide high availability by having redundant hardware components and running software services in a hot standby mode. This redundancy model attempts to immediately switch to standby hardware/software in case of a fault. A kernel crash, file system corruption, or other software crash in one of the software components causes all of the services on the physical device to be passed to the standby hardware/software. In the case where no standby is available, however, the entire physical switch is reset, which interrupts the processing of all of the services on the physical switch.

A redundant supervisor model may allow a physical switch to remain operational in the event of a failure in one of its services. In this model, multiple supervisor processes run on a single physical device. One of the supervisors is active, and controls the various services available on the device. If a failure should occur, the supervisor may go down, and then the standby supervisor is activated. This allows services to remain active even though they reside on the same physical switch as a service that has gone down. However, in the case where the failure occurs do to an external issue, such as a fault caused by an external device or based on corrupted network topology, the standby supervisor will suffer the same failure as the original supervisor.

Therefore, virtualization may be provided to allow for the logical partitioning of a physical device into multiple partitions. This can be used in lieu of a redundant supervisor model and provides protection even in the case of a failure caused by an external issue. The logical partitioning also addresses another issue. Some of the services available on the device are applied to all the interfaces (unless particular interfaces are specified). One such example is Spanning Tree Protocol (STP) where STP either can be running on a per VLAN Rapid Spanning Tree (PVRST) or on Multiple Spanning Tree (MST) mode for the entire switch. If the network topology is such that one partition (part?) of the network can run PVRST because of a small number of VLANs and another part of the network needs to run MST due to scalability, it becomes possible to do so in a single device by utilizing logical partitioning.

A virtual device context (VDC) is a way to partition a single physical device into multiple logical devices to provide fault isolation, management isolation, address allocation isolation, service differentiation domains, adaptive resource management, and other service isolation. VDCs allow each instance within a physical device to be managed independently from each other. Each VDC may be carved out with certain resources allocated to it by a supervisor-user. Once the resources are assigned, in some implementations they may be managed by administrators of that VDC only.

FIG. 1 is a diagram illustrating an example of a virtualization architecture. A single physical box 100 may contain multiple VDCs 102 a, 102 b. In this example, VDC 102 a contains several layer 2 protocols 104 and a routing protocol 106. The routing protocol may be used along with Routing Information Bases (RIBs, also called routing tables) 108 and a protocol stack 110. VDC 102 b contains a routing protocol 112 that may or may not be the same as the routing protocol 106 for VDC 102 a, along with RIB tables 114 and a protocol stack 116. Both VDC's may share certain software on the physical box, including infrastructure 118 and kernal 120.

A fault in one of the services of VDC 102 a would not affect the operation of VDC 102 b (and vice-versa). Likewise, an administrator could upgrade VDC 102 a without affecting the operation of VDC 102 b (and vice-versa).

For purposes of this document, when a VDC is said to be “defined on” or “created on” a physical device, this shall be interpreted to mean that the VDC is set up to operate on the physical device as a logical component of the device. While in many embodiments this may involve the storage of VDC configuration information in memory on the physical device, embodiments are also possible wherein the configuration or other “definition” information for the VDC are stored elsewhere than the physical device. The terms “defined on” and “created on” shall be interpreted to encompass all of these different embodiments.

FIG. 2 is a diagram illustrating an example of using VDCs for high availability. Here, two physical devices 200 a, 200 b are utilized for redundant supervisor control. Each physical device 200 a, 200 b has a supervisor service 202 a, 202 b. The supervisor services 202 a, 202 b may both be active at the same time. The VDCs within each physical device 200 a, 200 b allow, for example, supervisor service 202 a to remain active even though a service in VDC1 204 has gone down. It is even possible to fail over the failed VDC1 204 to a different physical device while VDC2 206 and VDCn 208 remain on physical device 200 a and supervisor 202 a remains active.

FIG. 3 is a diagram illustrating an example of using VDCs for improved hardware resource utilization. Here, VDCs set up on multiple physical devices 300 a, 300 b can be selectively utilized to best allocate available hardware resources. For example, VDC1 302 and VDCn 304 may be operated on physical device 300 b while VDC2 306 may be operated on physical device 300 a. Should circumstances change, it may then be more efficient to switch operation over VDC1 302 to physical device 300 a, which can be done without causing any effect on VDCn 304 or any other VDCs running on physical device 300 b.

FIG. 4 is a flow diagram illustrating an example of a method for operating a physical device in a computer network. The physical device may be any type of network device, but in one embodiment the physical device is a router or switch. At 400, an indication of a fault condition relating to a first service running on the physical device is received. The first service is associated with a first virtual device context defined on the physical device. The first virtual device context and the second virtual device context may each be associated with one or more different types of services, including, for example, layer 2 services, layer 3 services, and storage area network services. In one embodiment, each of the first virtual device context and second virtual device context are associated with at least one layer 2 service. These layer 2 services may be utilizing the same or different protocols. In another embodiment, the first virtual device context represents a first supervisor process and the second virtual device context represents a second supervisor process, wherein both the first supervisor process and the second supervisor process are active at the same time. In yet another embodiment, one of the virtual device contexts acts as an active supervisor and another of the virtual device contexts acts as a backup supervisor. The physical device may have a plurality of interfaces and these interfaces may be assigned to various virtual device context. In this embodiment, at 402, the first service is assigned a first subset of the plurality of interfaces and at 404, the second service is assigned a second subset of the plurality of interfaces, wherein the first subset and the second subset have no overlapping interfaces. At 406, the first service running on the physical device is disabled without affecting operation of a second service on the physical device. The second service is associated with a second virtual device context defined on the physical device. It should be noted that the phrase “without affecting operation of” throughout this document should be interpreted to mean “without causing a failure or significant reduction in efficiency of.”

FIG. 5 is a flow diagram illustrating another example of a method for operating a physical device in a computer network. The physical device may be any type of network device, but in one embodiment the physical device is a router or switch. At 500, a first virtual device context is created on the physical device. The first virtual device context is a logical grouping of one or more services. At 502, a second virtual device context is created on the physical device. The second virtual device context is another logical grouping of one or more services. The first virtual device context is capable of being managed independently of the second virtual device context such that resources assigned to a virtual device context are capable of being managed without affecting management of another virtual device context. It should be noted that the phrase “without affecting management of” throughout this document should be interpreted to mean “without causing a significant effect on control or operation of.”

At 504, the first virtual device context may be utilized on the physical device. Several alternative embodiments are possible is this step. In one embodiment, an Active-Active high availability redundancy model may be configured using the first virtual device context and the second virtual device context by defining both the first virtual device context and the second virtual device contexts as active redundant supervisors. In another embodiment, the first virtual device context is configured to act as an active supervisor while the second virtual device context may be configured to act as a backup supervisor. In another embodiment, the second virtual device context may be assigned to another physical device without affecting operation of the first virtual device context. In another embodiment, the physical device may be operated according to the process described in FIG. 4 and the accompanying text above. In another embodiment, the first virtual device context may be upgraded without affecting operation of the second virtual device context.

FIG. 6 is a simplified block diagram illustrating an example of a router or switch on which one or more of the processes described above may be run. The router or switch 600 may have one or more interfaces 602 and a memory 604. The router or switch 600 may then also have a processor 604 that may be configured to perform the above-described processes and store virtual device context information in the memory 604. Other information relating to the processes described above may also be stored in the memory 604 by the processor 606.

Embodiments are also envisioned wherein the router or switch comprises one or more line cards, wherein each line card may contain the architecture of the switch or router in FIG. 6. Further embodiments are possible wherein the switch, router, or line cards may each contain multiple processors that distribute the load.

VDCs allow software fault isolation across different logical instances. A fault in one logical instance does not affect any other logical instance. Therefore, the effects of a fault are contained within a single logical instance. This fault could be any kind of software service crash, kernal crash, misconfiguration, security attack, or contamination of resource such as file system corruption. VDCs can greatly improve the stability of the physical device.

Since each VDC may run a different instance of an image, it is also possible to upgrade or patch an individual service or an entire image for the VDC without taking other services offline. This also allows administrators to fix certain software bugs, for example, topology-related bugs, as each VDC could be a part of a different network topology. Since two VDCs are independent of one another, each VDC can run a different software version, thus providing flexibility for customers to test new versions of software on the same hardware device without affecting their production network.

The independent nature of each VDC also allows a new high availability redundancy model of Active-Active for control processes. Usually a high availability model uses one hot standby supervisor to run software services in standby mode. The standby supervisor takes over the function of the active supervisor in case of software failure based on predefined policies. By using VDCs, both the supervisor and the backup can actually be in active mode at the same time and the user can even selectively fail over a VDC to different hardware if desired. This allows a much more flexible high availability model. It also allows users to utilize all the hardware resources on a device. Even in the case of a single supervisor it is possible to have an active-standby model where a VDC could be acting as a standby of the same supervisor.

Furthermore, a supervisor-user of the physical device can assign resources to a VDC. Resources could be any physical resource such as interfaces, cpu, memory, TCAM space, L2 VLANs, routing information learnt, etc. Once resources are assigned to VDC, it may be managed only through the VDC context.

Additionally, each VDC may have its own configuration and authentication domain which could be independent and different from the physical device. All the management and system messages may also be localized to the VDC. This provides isolation in hosted environments where a user may want to hide configuration from other users who are co-hosted on the same physical device. This allows for greater flexibility in hosted environments where multiple administrators are co-hosted on one physical device.

VDCs also provide service differentiation across logical instances. VDCs allow a user to run service instances on a per-VDC basis and thus enables the user to run different services in each logical instance independent of each other. This also improves reliability of the network across logical instances where, for example, a loop caused by STP in one logical instance would not bring down the entire network.

Although illustrative embodiments and applications of this invention are shown and described herein, many variations and modifications are possible which remain within the concept, scope, and spirit of the invention, and these variations would become clear to those of ordinary skill in the art after perusal of this application. Accordingly, the embodiments described are to be considered as illustrative and not restrictive, and the invention is not to be limited to the details given herein, but may be modified within the scope and equivalents of the appended claims. 

1. A method comprising: receiving an indication of a fault condition relating to a first service running on a physical device in a computer network, wherein the first service is associated with a first virtual device context defined on the physical device; and disabling the first service without affecting operation of a second service on the physical device, wherein the second service is associated with a second virtual device context defined on the physical device.
 2. The method of claim 1, wherein the first virtual device context is associated with one or more services from the group consisting of: layer 2 services, layer 3 services, and storage area network services and the second virtual device context is associated with one or more services from the group consisting of: layer 2 services, layer 3 services, and storage area network services.
 3. The method of claim 1, wherein the first service is a first layer 2 service and the second service is a second layer 2 service operating under a different protocol than the first layer 2 service.
 4. The method of claim 1, wherein the first virtual device context represents a first supervisor process and the second virtual device context represents a second supervisor process, wherein both the first supervisor process and the second supervisor process are active at the same time.
 5. The method of claim 1, wherein the physical device has a plurality of interfaces and wherein the first service is assigned a first subset of the plurality of interfaces and the second service is assigned a second subset of the plurality of interfaces, wherein the first subset and the second subset have no overlapping interfaces.
 6. The method of claim 1, wherein the physical device is a switch or router.
 7. The method of claim 1, wherein the first virtual device context and the second virtual device context are each associated with a shared layer 2 service.
 8. A method comprising: creating a first virtual device context on a physical device in a computer network, wherein the first virtual device context is a first logical grouping of one or more services; and creating a second virtual device context on the physical device, wherein the second virtual device context is a second logical grouping of one or more services, wherein the first virtual device context is capable of being managed independently of the second virtual device context such that resources assigned to either virtual device context are capable of being managed without affecting management of the other virtual device context.
 9. The method of claim 8, further comprising upgrading the first service without interrupting processing of the second service.
 10. The method of claim 8, further comprising: defining both the first virtual device context and the second virtual device contexts as active redundant supervisors.
 11. The method of claim 8, further comprising: configuring the first virtual device context to act as an active supervisor; and configuring the second virtual device context to act as a backup supervisor.
 12. A network device comprising: one or more interfaces; a memory; and a processor configured to receive an indication of a fault condition relating to a first service running on the network device, wherein the first service is associated with a first virtual device context defined in the memory, and further configured to disable the first service running without affecting operation of a second service, wherein the second service is associated with a second virtual device context defined in the memory.
 13. The network device of claim 12, wherein the first virtual device context is associated with one or more services from the group consisting of: layer 2 services, layer 3 services, and storage area network services and the second virtual device context is associated with one or more services from the group consisting of: layer 2 services, layer 3 services, and storage area network services.
 14. The network device of claim 12, wherein the first service is assigned a first subset of the interfaces and the second service is assigned a second subset of the interfaces, wherein the first subset and the second subset have no overlapping interfaces.
 15. The network device of claim 12, wherein the network device is a switch or router.
 16. The network device of claim 12, wherein the first virtual device context and the second virtual device context are each associated with a shared layer 2 service.
 17. A network device comprising: one or more interfaces; a memory; and a processor configured to create a first virtual device context in the memory, wherein the first virtual device context is a logical grouping of one or more services, and further configured to create a second virtual device context in the memory, wherein the second virtual device context is a logical grouping of one or more services, and wherein the first virtual device context is capable of being managed independently of the second virtual device context such that resources assigned to a virtual device context is capable of being managed without affecting management of another virtual device context.
 18. The network device of claim 17, wherein both the first virtual device context and the second virtual device contexts are defined as active redundant supervisors.
 19. The network device of claim 17, wherein the first virtual device context is configured to act as an active supervisor and the second virtual device context is configured to act as a backup supervisor.
 20. An apparatus comprising: means for receiving an indication of a fault condition relating to a first service running on a physical device in a computer network, wherein the first service is associated with a first virtual device context defined on the physical device; and means for disabling the first service without affecting operation of a second service on the physical device, wherein the second service is associated with a second virtual device context defined on the physical device.
 21. An apparatus comprising: means for creating a first virtual device context on a physical device in a computer network, wherein the first virtual device context is a logical grouping of one or more services; means for creating a second virtual device context on the physical device, wherein the second virtual device context is a logical grouping of one or more services; and wherein the first virtual device context is capable of being managed independently of the second virtual device context such that resources assigned to a virtual device context is capable of being managed without affecting management of another virtual device context. 